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Abstract 



A search for pair production of first-generation scalar leptoquarks is performed in 
the final state containing two electrons and two jets using proton-proton collision 
data at ^/s = 7 TeV. The data sample used corresponds to an integrated luminosity of 
33 pb^^ collected with the CMS detector at the CERN LHC. The number of observed 
events is in good agreement with the predictions for the standard model background 
processes, and an upper limit is set on the leptoquark pair production cross section 
times as a function of the leptoquark mass, where /S is the branching fraction of the 
leptoquark decay to an electron and a quark. A 95% confidence level lower limit is 
set on the mass of a first-generation scalar leptoquark at 384 GeV for j6 = 1, which is 
the most stringent direct limit to date. 
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Although the standard model (SM) of fundamental particles and their interactions is in excel- 
lent agreement with most collider data, there are compelling reasons to believe new physics 
should appear at high energy scales. Some well-motivated theories of physics beyond the SM, 
including grand unified theories [1]/ composite models [2J, technicolor [3 -5|, and superstring- 
inspired E(, models JH, postulate the existence of a symmetry, beyond that of the SM, relating 
quarks and leptons and implying the existence of new bosons, called leptoquarks (LQ). An LQ 
carries colour, has fractional electric charge, can have spin (scalar) or spin 1 (vector), and cou- 
ples to a lepton and a quark with coupling strength A. An LQ would decay to a charged lepton 
and a quark, with an unknown branching fraction j5, or a neutrino and a quark, with branching 
fraction 1 — j6. A review of LQ phenomenology and searches can be found in [7, 8]. Constraints 
from experiments sensitive to flavour-changing neutral currents, lepton-family-number vio- 
lation, and other rare processes [9| favour LQs that couple to quarks and leptons within the 
same SM generation, for LQ masses accessible at current colliders. The first-generation scalar 
LQs studied in this Letter couple only to an electron or an electron neutrino and a light quark. 
Measurements at electron-proton colliders constrain the coupling A to be comparable to or less 
than the electromagnetic coupling Aem = V^'^'^em ^ 0-3/ for a first generation LQ mass, Mlq, 
less than 300 GeV ||TOl ITT| . Prior to this work, the D0 Collaboration set the most stringent 
limit for a broad range of the coupling A on the mass of the first-generation scalar LQ, namely, 
Mlq > 299 GeV for |6 = 1 UU. 

This Letter presents the results of a search for pair production of first-generation scalar LQs 
using events containing two electrons and two jets from a data sample of pp collisions at 
= 7 TeV collected in 2010 with the Compact Muon Solenoid (CMS) detector at the LHC. 
The data sample corresponds to an integrated luminosity of 33.2±3.7 pb^^. In pp collisions 
at this energy, LQs are predominantly produced in pairs via gluon-gluon fusion and quark- 
antiquark annihilation with a cross section that depends on the strong coupling constant but 
is nearly independent of A. This cross section depends on the spin and the mass of the LQ 
and has been calculated including Next-to-Leading-Order (NLO) Quantum Chromodynamics 
(QCD) corrections [13] . In this study we did not consider possible contributions from single LQ 
production, which has a cross section that is dependent on A. The CMS experiment [14J uses 
a right-handed coordinate system, with the origin at the nominal interaction point, the x-axis 
pointing to the center of the LHC ring, the y-axis pointing up (perpendicular to the LHC plane), 
and the z-axis along the anticlockwise-beam direction. The polar angle, 9, is measured from the 
positive z-axis and the azimuthal angle, (p, is measured in the x-y plane. The pseudorapidity 
is given by // = — ln(tan0/2). The central feature of the CMS apparatus is a superconducting 
solenoid, of 6 m internal diameter, providing a field of 3.8 T. Within the field volume are the 
silicon pixel and strip tracker, the crystal electromagnetic calorimeter (ECAL), which includes 
a silicon sensor preshower detector in front of the ECAL endcaps, and the brass/ scintillator 
hadron calorimeter (HCAL). Muons are measured in gas-ionization detectors embedded in the 
steel return yoke. In addition to the barrel and endcap detectors, CMS has extensive forward 
calorimetry. The ECAL has an ultimate energy resolution of better than 0.5% for unconverted 
photons with transverse energies above 100 GeV. The energy resolution is 3% or better for 
the range of electron energies relevant for this analysis. The HCAL, when combined with the 
ECAL, measures jets with a resolution AE/E ^ 100%/y£[GeV] 5%. The inner tracker 
measures charged particles within \ f]\ < 2.5 and provides an impact parameter resolution of 
~ 15 ^m and a transverse momentum (px) resolution of about 1.5 % for 100 GeV particles. The 
relative luminosity is measured using the forward calorimeters. Collision events were selected 
by a first level trigger made of a system of fast electronics and a higher level trigger that con- 
sists of a farm of commercial CPUs running a version of the offline reconstruction optimized 
for fast processing. 
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Events used in this analysis are collected with an efficiency of 100% by single and double 
electron triggers with various thresholds depending on the instantaneous luminosity. Offline, 
events are required to contain at least one primary vertex with z-position within 24 cm of the 
nominal center of the detector. Electron candidates are required to have an electromagnetic 
cluster in ECAL that is spatially matched to a reconstructed track in the central tracking system 
in both r] and cp. Electron candidates are further required to have a shower shape consistent 
with that of an electromagnetic shower, have a ratio between the hadronic and electromagnetic 
energy of less than 5%, and be isolated within a cone Af/^ + Acp^ < 0.3 from other energy de- 
posits in the calorimeter and from additional reconstructed tracks (beyond the matched track) 
in the central tracking system. More information about electron triggering and identification 
at CMS can be found elsewhere [15]. Jets, the experimental signature of the hadronization of 
partons, are reconstructed in this analysis from calorimetry information by the anti-kj algo- 
rithm [16] with the distance parameter set to 0.5. The energy response of the jets is adjusted 
by applying a correction determined from Monte Carlo (MC) simulated events and a residual 
correction derived from data by analyzing the pj balance in di-jet events [17J. 

The collision data were compared to samples of MC generated events, where the response of 
the detector was simulated using GEANT4 [18 1. The selection procedure as well as the electron 
and jet reconstructions described for the data are also applied to the MC simulation samples. 
Signal samples for LQ masses from 200 to 500 GeV were generated with pythia |[T9) . version 
6.422, tune D6T [20'. 1215. An initial sample containing at least two electrons and at least two jets 
is selected. The dominant SM processes that produce such events are Z/ 7+jets and tt, which are 
simulated, respectively, using alpgen [|22| and MadGraph ||23l |24l interfaced with pythia 
for parton showering and hadronization. Other backgrounds include multijet production with 
two jets misidentified as electrons and W+jets events with one jet mis-identified as an electron. 
There is also a small contribution from di-boson and single top production. The two leading 
(in Pj) electrons and two leading jets are used in the analysis and, to reduce the backgrounds, 
required to have pj > 30 GeV. Selected electrons and jets have pseudorapidities |//| < 2.5 and 
< 3.0, respectively, and if any of the selected electrons are closer than AR = \J Ayf- + A^^ = 
0.7 to any of the selected jets, the event is rejected. In addition, the preliminary requirements 
Mee > 50 GeV and Sj >250 GeV are applied, where Mee is the di-electron invariant mass 
and St is defined as the sum of the magnitudes of the pj of the two leading electrons and 
two leading jets. At this stage of the selection, referred to as pre-selection, there are sufficient 
data to compare with the MC predictions. Good data-MC agreement is observed in the shape 
of all kinematic distributions of the selected electrons and jets. Fig. [l] shows the Mee and Sj 
distributions. The Z/ 7+jets MC distributions have been normalized to the data at the Z boson 
mass, as described later. 

To reduce the background from Z — ?► ee production, a minimal value of Mee well above the 
mass of the Z boson is required, and, to reduce all SM backgrounds, Sj is required to be large. 
While the LQ signal is expected to appear as a peak in the mass distribution of the electron-jet 
pairs, we find that the Sj variable is more powerful with the present statistics as it is not affected 
by combinatorics. The minimal values required for Mee srid Sj were optimized by minimizing 
the expected upper limit on the leptoquark cross section in the absence of an observed signal 
using a Bayesian approach ||25[ l26li that is well suited for counting experiments in the Poisson 
regime. The optimized lower value of Mee is found to be 125 GeV for all the LQ hypotheses un- 
der test, while the lower value of Sj varies as indicated in Tablejl] Table[l]shows the number of 
surviving events for MC signal, MC background and data samples after applying the full opti- 
mized selection. The reported product of signal selection efficiency and acceptance is estimated 
from MC simulated events. The product of the di-electron efficiency and acceptance, prior to 
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Figure 1: On the left: the Mee distribution for events that have passed the pre-selection re- 
quirements. On the right: the Sj distribution for events that have passed the pre-selection 
requirement, except the pre-selection requirement on Sj itself (Sj > 250 GeV), and have Mee 
> 125 GeV. The MC distributions for the signal {f> = 1) and the contributing backgrounds are 
shown. The Z/ 7+jets MC has been normalized as described in the text. Other backgrounds 
include W+jets, di-boson, and single top. All background histograms are cumulative. 



any Mee and jet requirements, varies from 58.7% to 68.0% for LQ masses from 200 to 500 GeV. 
The Z/ 7+jets background dominates the pre-selection sample. After the pre-selection, the ratio 

Table 1: Number of events for MC LQ signal (for /3 = 1), MC background, and data samples 
after the full analysis selection and corresponding to an integrated luminosity of 33.2 pb^^. 
The product of signal acceptance and efficiency is also reported for different LQ masses. The 
Z/ 7+jets MC has been normalized to the data at the Z boson mass as described in the text. 
Other backgrounds include W + jets, di-boson, and single top. Uncertainties are statistical. 
Systematic uncertainties are discussed later. The observed and expected 95% C.L. upper limit 
(u.l.) on the leptoquark pair production cross section a are shown in the last column. 



Mlq 


Signal Samples (MC) 


Standard Model Backgroimd Samples (MC) 


Events 


Obs./Exp. 


(St Cut) 


Selected 


Acceptance 




Selected Events in 




in 


95% C.L. 


[GeV] 


Events 


X Efficiency 


tt + jets 


Z/7 + jets 


Others 


All 


Data 


u.l. on u [pb] 


200 (St >340) 


117.5±0.8 


0.297±0.002 


2.2 ±0.09 


2.0 ±0.2 


0.27±0.05 


4.5 ±0.2 


2 


0.445 / 0.692 


250 (St >400) 


43.8±0.2 


0.380±0.002 


1.1 ±0.06 


1.3 ±0.1 


0.14±0.02 


2.5 ±0.1 


1 


0.311 / 0.442 


280 (St >450) 


24.4±0.1 


0.403±0.002 


0.59±0.05 


0.87±0.07 


0.10±0.02 


1.6 ±0.1 


1 


0.309 / 0.361 


300 (St >470) 


17.3±0.1 


0.430±0.002 


0.44±0.04 


0.75±0.07 


0.10±0.02 


1.3 ±0.1 


1 


0.294 / 0.326 


320 (St >490) 


12.3±0.1 


0.451±0.002 


0.37±0.04 


0.65±0.07 


0.08±0.02 


1.1 ±0.1 


1 


0.285 / 0.299 


340 (St >510) 


8.88±0.04 


0.469±0.002 


0.27±0.03 


0.56±0.06 


0.08±0.02 


0.91±0.08 


1 


0.279 / 0.274 


370 (St >540) 


5.55±0.02 


0.496±0.002 


0.22±0.03 


0.47±0.06 


0.07±0.02 


0.76±0.07 


1 


0.268 / 0.251 


400 (St >560) 


3.55±0.02 


0.522±0.002 


0.17±0.02 


0.41±0.05 


0.06±0.02 


0.64±0.06 


1 


0.259 / 0.230 


450 (St >620) 


1.70±0.01 


0.539±0.002 


0.10±0.02 


0.28±0.05 


0.02±0.01 


0.40±0.06 





0.174 / 0.209 


500 (St >660) 


0.868±0.003 


0.565±0.002 


0.07±0.02 


0.23±0.05 


0.02±0.01 


0.32±0.05 





0.166 / 0.193 



between data and MC events with 80 < Mee < 100 GeV (where the contamination from other 
SM processes is 3%) is 1.20±0.14. This ratio is used to normalize the Z/ 7+jets MC. The statis- 
tical uncertainty on this normalization factor is used as an uncertainty on the MC estimate of 
the Z/ 7+jets background after the full selection. The tt background is estimated from MC with 
an uncertainty, 41%, taken from the uncertainty on the CMS measurement of the tt cross sec- 
tion [27] . Since this measurement is consistent with NLO predictions, no rescaling of the tt MC 
is applied. The small contribution from other background processes containing vector bosons 
is estimated by MC. The multijet background is determined from data. The probability that 
an isolated electromagnetic cluster is reconstructed as an electron is measured in a background 
sample requiring a single cluster, a jet multiplicity similar to the analysis final state, and small 
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missing transverse energy. This probabiUty and a data sample with two or more of these clus- 
ters and two or more jets were used to determine the multijet contribution to the final selection 
sample. The resulting systematic uncertainty is determined to be 20%. This background ac- 
counts for < 1% of the total background for all LQ masses hypotheses with a decreasing trend 
for increasing LQ mass hypothesis, and is not considered any further. 

The systematic uncertainties affecting the number of expected signal and background events 
are summarized in Table |2] The jet and electron energy scale uncertainties are given in the sec- 
ond column of Table |2] The reconstruction, identification, and isolation efficiency for electrons 
is determined from MC simulated events and a systematic uncertainty is assessed using Z — > ee 
events from collision data. The statistical uncertainty on the number of MC events surviving 
the full event selection is reported in Table [ijfor the signal and background. The uncertainty on 
the integrated luminosity of the data sample is dominated by the uncertainty on the measure- 
ment of the beam current [28 J. Uncertainties due to the choice of parton distribution functions 
(PDF) of the proton lead to changes in the total cross section and the acceptance for both signal 
and background processes. The effect of the PDF uncertainties on the signal acceptance is esti- 
mated using an event re-weighting technique that uses the LHAPDF package |29| and amounts 
to 0.1%. Since a background normalization uncertainty is assessed based on data, uncertainties 
due to the PDF choice, electron efficiencies, and integrated luminosity are not applicable to the 
background estimate. 

Table 2: Summary of the systematic uncertainties affecting the number of signal and back- 
ground events for the hypothesis of LQ with mass of 300 GeV. 



Systematic 


Magnitude 


Effect on 


Effect on 


Uncertainty 


[%] 


Signal [%] 


Background [%] 


Background Normalization Uncertainty 






21 


Jet Energy Scale 


5 


3 


11 


Electron Energy Scale Barrel /Endcap 


1/3 


1 


5 


Electron Pair Reco, ID, Iso 


10 


10 




MC Statistics 




1 


6 


Integrated Luminosity 


11 


11 




Total 




15 


25 



The number of observed events in the collision data sample that pass the selection criteria op- 
timized for each LQ mass considered is consistent with the prediction from SM processes, as 
reported in Table [T] An upper limit on the LQ cross section in the absence of signal is therefore 
set using a Bayesian approach [26J that uses a Poisson likelihood, a flat prior for the signal 
cross section, and log-normal priors for the parameters used to model the systematic uncer- 
tainties. Systematic uncertainties for the signal are dominated by the uncertainty on the inte- 
grated luminosity and the electron selection efficiencies, while the systematic uncertainties for 
the background are dominated by the uncertainty derived from data. Fig. |2] (left) shows the 
95% Confidence Level (C.L.) upper limit on the LQ pair production cross section times as a 
function of the leptoquark mass for 33.2 pb^^ of integrated luminosity. The systematic uncer- 
tainties reported in Table |2]are included in the calculation. The upper limits are compared to an 
NLO prediction of the LQ pair production cross section [13] to set a 95% C.L. exclusion on LQ 
masses smaller than 384 GeV (expected 391 GeV), assuming fi = 1. A theoretical uncertainty on 
the signal production cross sections due to the choice of renormalization/ factorization scales 
(14-15% for all LQ masses considered) has been calculated by varying the scales between half 
and twice the LQ mass, while a 90% C.L. PDF uncertainty (from 8 to 22% for LQ masses from 
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200 to 500 GeV) has been obtained from the CTEQ6.6 error PDF set following the standard 
prescription detailed in Ref. I|30| . If the observed cross section upper limit is compared with 
the lower boundary of the cross section uncertainty band, the lower limit on the LQ mass for 
/3 = 1 becomes 370 GeV (expected 375 GeV). Fig.|2](right) shows the minimum /3 for a 95% C.L. 
exclusion of the LQ hypothesis as a function of LQ mass. 
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Figure 2: On the left: the expected and observed upper limit at 95% C.L. on the LQ pair pro- 
duction cross section times as a function of the LQ mass. The systematic uncertainties re- 
ported in Table |2] are included in the calculation. The shaded region is excluded by the current 
D0 limit for /S = 1. The cftheory curve and its band represent, respectively, the theoretical LQ 
pair production cross section and the uncertainties due to the choice of PDF and renormal- 
ization/ factorization scales |l3l . On the right: minimum /S for a 95% C.L. exclusion of the LQ 
hypothesis as a function of LQ mass. The observed (expected) exclusion curve is obtained using 
the observed (expected) upper limit and the central value of the theoretical LQ pair production 
cross section. The band around the observed exclusion curve is obtained by considering the ob- 
served upper limit while taking into account the uncertainties on the theoretical cross section. 
The shaded region is excluded by the current D0 limits, which combines results from searches 
in the two electron, electron-neutrino, and two neutrino channels. 

In conclusion, a search for pair production of first-generation scalar leptoquarks has been 
presented. The number of collision events, passing a selection optimized for exclusion of the 
LQ hypothesis, is in good agreement with the predictions for the SM background processes. A 
Bayesian approach that includes the treatment of the systematic uncertainties as nuisance 
parameters has been used to set an upper limit on the LQ cross section. By comparing this 
upper limit to a theoretical calculation of the LQ pair production cross section, the existence of 
first-generation scalar LQ with masses below 384 GeV for /3 = 1 has been excluded at 95% 
C.L., with a corresponding cross section limit of 0.267 pb. The lower limits on the LQ mass set 
for values of fi larger than about 0.4 are the most restrictive direct limits to date. We wish to 
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MoST, and NSFC (China); COLCIENCIAS (Colombia); MSES (Croatia); RPF (Cyprus); 
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and CNRS/IN2P3 (France); BMBF, DFG, and HGF (Germany); GSRT (Greece); OTKA and 
NKTH (Hungary); DAE and DST (India); IPM (Iran); SFI (Ireland); INFN (Italy); NRF and 
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